Goto

Collaborating Authors

 weighted average 0


Predicting Coronary Heart Disease Using a Suite of Machine Learning Models

Al-Karaki, Jamal, Ilono, Philip, Baweja, Sanchit, Naghiyev, Jalal, Yadav, Raja Singh, Khan, Muhammad Al-Zafar

arXiv.org Artificial Intelligence

Coronary Heart Disease affects millions of people worldwide and is a well-studied area of healthcare. There are many viable and accurate methods for the diagnosis and prediction of heart disease, but they have limiting points such as invasiveness, late detection, or cost. Supervised learning via machine learning algorithms presents a low-cost (computationally speaking), non-invasive solution that can be a precursor for early diagnosis. In this study, we applied several well-known methods and benchmarked their performance against each other. It was found that Random Forest with oversampling of the predictor variable produced the highest accuracy of 84%.


Machine Learning Models for the Identification of Cardiovascular Diseases Using UK Biobank Data

Islam, Sheikh Mohammed Shariful, Abrar, Moloud, Tegegne, Teketo, Loranjo, Liliana, Karmakar, Chandan, Awal, Md Abdul, Hossain, Md. Shahadat, Kabir, Muhammad Ashad, Mahmud, Mufti, Khosravi, Abbas, Siopis, George, Moses, Jeban C, Maddison, Ralph

arXiv.org Artificial Intelligence

Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore, we aimed to develop machine learning models for CVD detection using primary healthcare data, compare the performance of different models, and identify the best models. We used data from the UK Biobank study, which included over 500,000 middle-aged participants from different primary healthcare centers in the UK. Data collected at baseline (2006--2010) and during imaging visits after 2014 were used in this study. Baseline characteristics, including sex, age, and the Townsend Deprivation Index, were included. Participants were classified as having CVD if they reported at least one of the following conditions: heart attack, angina, stroke, or high blood pressure. Cardiac imaging data such as electrocardiogram and echocardiography data, including left ventricular size and function, cardiac output, and stroke volume, were also used. We used 9 machine learning models (LSVM, RBFSVM, GP, DT, RF, NN, AdaBoost, NB, and QDA), which are explainable and easily interpretable. We reported the accuracy, precision, recall, and F-1 scores; confusion matrices; and area under the curve (AUC) curves.


Use GPT-J Prompt Generation with RoBERTa for NER Models on Diagnosis Extraction of Periodontal Diagnosis from Electronic Dental Records

Chuang, Yao-Shun, Jiang, Xiaoqian, Lee, Chun-Teh, Brandon, Ryan, Tran, Duong, Tokede, Oluwabunmi, Walji, Muhammad F.

arXiv.org Artificial Intelligence

The extent is indicated by the percentage of teeth affected by periodontitis at the identified stage. Grading depends on the risk of disease progression associated with the history of disease progression, local and systemic factors. Despite the introduction of new diagnostic terms for periodontal diseases, dental care providers might not be acquainted with them due to the complexity of this new system. This results in clinical documentation lacking accurate and structured diagnosis, or in some cases, no diagnosis being recorded. Inadequate periodontal diagnoses poses a significant threat to patient care quality. An accurate diagnosis is key to the provision of appropriate patient care, outcome assessment and quality improvement efforts. This, in turn, may hinder future care providers from evaluating the patient's condition precisely and providing optimal treatment. Electronic dental records (EDR) have become widely adopted in dental care, providing an opportunity to address the issue of missing diagnoses. EDRs include comprehensive information on a patient's history, clinical examination, diagnosis, treatment, and prognosis


PPG Signals for Hypertension Diagnosis: A Novel Method using Deep Learning Models

Frederick, Graham, T, Yaswant, A, Brintha Therese

arXiv.org Artificial Intelligence

Hypertension is a medical condition characterized by high blood pressure, and classifying it into its various stages is crucial to managing the disease. In this project, a novel method is proposed for classifying stages of hypertension using Photoplethysmography (PPG) signals and deep learning models, namely AvgPool_VGG-16. The PPG signal is a non-invasive method of measuring blood pressure through the use of light sensors that measure the changes in blood volume in the microvasculature of tissues. PPG images from the publicly available blood pressure classification dataset were used to train the model. Multiclass classification for various PPG stages were done. The results show the proposed method achieves high accuracy in classifying hypertension stages, demonstrating the potential of PPG signals and deep learning models in hypertension diagnosis and management.


Overlapping Word Removal is All You Need: Revisiting Data Imbalance in Hope Speech Detection

LekshmiAmmal, Hariharan RamakrishnaIyer, Ravikiran, Manikandan, Nisha, Gayathri, Balamuralidhar, Navyasree, Madhusoodanan, Adithya, Madasamy, Anand Kumar, Chakravarthi, Bharathi Raja

arXiv.org Artificial Intelligence

Hope Speech Detection, a task of recognizing positive expressions, has made significant strides recently. However, much of the current works focus on model development without considering the issue of inherent imbalance in the data. Our work revisits this issue in hope-speech detection by introducing focal loss, data augmentation, and pre-processing strategies. Accordingly, we find that introducing focal loss as part of Multilingual-BERT's (M-BERT) training process mitigates the effect of class imbalance and improves overall F1-Macro by 0.11. At the same time, contextual and back-translation-based word augmentation with M-BERT improves results by 0.10 over baseline despite imbalance. Finally, we show that overlapping word removal based on pre-processing, though simple, improves F1-Macro by 0.28. In due process, we present detailed studies depicting various behaviors of each of these strategies and summarize key findings from our empirical results for those interested in getting the most out of M-BERT for hope speech detection under real-world conditions of data imbalance.


Explainable Event Recognition

Khan, Imran, Ahmad, Kashif, Gul, Namra, Khan, Talhat, Ahmad, Nasir, Al-Fuqaha, Ala

arXiv.org Artificial Intelligence

The literature shows outstanding capabilities for CNNs in event recognition in images. However, fewer attempts are made to analyze the potential causes behind the decisions of the models and exploring whether the predictions are based on event-salient objects or regions? To explore this important aspect of event recognition, in this work, we propose an explainable event recognition framework relying on Grad-CAM and an Xception architecture-based CNN model. Experiments are conducted on three large-scale datasets covering a diversified set of natural disasters, social, and sports events. Overall, the model showed outstanding generalization capabilities obtaining overall F1-scores of 0.91, 0.94, and 0.97 on natural disasters, social, and sports events, respectively. Moreover, for subjective analysis of activation maps generated through Grad-CAM for the predicted samples of the model, a crowdsourcing study is conducted to analyze whether the model's predictions are based on event-related objects/regions or not? The results of the study indicate that 78%, 84%, and 78% of the model decisions on natural disasters, sports, and social events datasets, respectively, are based onevent-related objects or regions.